arXiv:1503.06870vl [cs.SI] 23 Mar 2015 


The Lifecycles of Apps in a Social Ecosystem 


Isabel Kloumann 
Cornell University 
Ithaca, NY 

imk36@cornell.edu 


Lada Adamic 
Facebook Inc. 
Menlo Park, CA 
ladamic@fb.com 


Jon Kleinberg 
Cornell University 
Ithaca, NY 

kleinber@cs.cornell.edu 


Shaomei Wu 

Facebook Inc. 
Menlo Park, CA 
shaomei@fb.com 


ABSTRACT 

Apps are emerging as an important form of on-line content, and 
they combine aspects of Web usage in interesting ways — they ex¬ 
hibit a rich temporal structure of user adoption and long-term en¬ 
gagement, and they exist in a broader social ecosystem that helps 
drive these patterns of adoption and engagement. It has been dif¬ 
ficult, however, to study apps in their natural setting since this re¬ 
quires a simultaneous analysis of a large set of popular apps and 
the underlying social network they inhabit. 

In this work we address this challenge through an analysis of the 
collection of apps on Facebook Login, developing a novel frame¬ 
work for analyzing both temporal and social properties. At the tem¬ 
poral level, we develop a retention model that represents a user’s 
tendency to return to an app using a very small parameter set. At 
the social level, we organize the space of apps along two funda¬ 
mental axes — popularity and sociality — and we show how a 
user’s probability of adopting an app depends both on properties 
of the local network structure and on the match between the user’s 
attributes, his or her friends’ attributes, and the dominant attributes 
within the app’s user population. We also develop models that show 
the importance of different feature sets with strong performance in 
predicting app success. 

Categories and Subject Descriptors: H.2.8 [Database Manage¬ 
ment] : Database applications —Data mining 
Keywords:apps; diffusion; social networks 


1. INTRODUCTION 

There is, or is likely soon to be, a webservice or app for virtually 
every component of modern life. They are diverse and ubiquitous; 
they constitute both a backdrop and chronicle of everyday expe¬ 
rience. And they represent a broad change in overall patterns of 
Internet use — both the research community and the media have 
increasingly begun discussing the “appification of the Web”[] Yet 
empirical opportunities to consider them as a complete ecosystem 
have been limited, and as a result we still know very little about 

1 https://sites.google.com/site/appweb2 012/ 
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the population structure of apps — their inherent diversity, their 
lifecycles, and the ways in which users engage with them. 

The high-level characteristics of app engagement as a form of 
Web use are still the subject of much discussion and refinement, but 
certain properties emerge independent of any one particular app’s 
functionality — these include temporal properties, based on long- 
running patterns of individual usage and engagement over time, and 
social properties , in which an individual will typically be a user of 
many apps with overlapping functionality, in a broader social envi¬ 
ronment that is bootstrapped to create within-app social activity. 

To address these issues, we study the collection of apps on Face- 
book Login, making use of anonymized aggregate daily usage logs 
of the apps and web services accessible through this mechanism. 
We undertake our analysis on two levels of scale — the individ¬ 
ual level, focusing on the properties of user behavior over time and 
in relation to other users; and the app level, modeling the overall 
usage level of the app and the social structure on its users. 

At the temporal level, we develop a user retention model, show¬ 
ing how with a small number of parameters we can approximate 
the probability that a user who adopts an app at time t will con¬ 
tinue to be using it at a future time t + A. The model exposes the 
ways in which usage decay has a time-dependent component, and 
provides us with a compact set of parameters representing a partic¬ 
ular app’s engagement profile that can then be used in higher-level 
tasks. When we consider the app’s user population as a whole, we 
are led to natural lifecycle modeling and prediction questions — 
given an app’s history up to a given point in time, how well can we 
predict its number of users going forward? Interesting recent work 
of Ribeiro G3 considered this question using time-series data for 
several large Web sites; we show how a broad range of feature cate¬ 
gories — including our derived retention parameters, together with 
individual characteristics of the app’s users and the social network 
structure on its full user population — can lead to strong prediction 
results across a wide diversity of apps. 

At the level of app social structure, we show how the space of 
all the popular apps on Facebook Login can be organized in a two- 
dimensional representation whose axes correspond to popularity — 
the raw number of users — and sociality — the extent to which 
users of the app have friends who are also users of the app. This 
representation exposes certain global organizing principles in the 
full app population, including a pair of complementary “frontiers” 
to the space — one containing apps whose sociality is relatively 
fixed independent of their popularity, and one in which the social¬ 
ity of the app’s user population is not much greater than that of a 
random set of Facebook users of comparable size. 

Finally, we perform an analysis of social characteristics at the in¬ 
dividual user level, analyzing the Facebook users who are one step 
away from an app in the social network — a set we can think of 



as the “periphery” of the app, containing people who are not yet 
users of the app, but have friends who are users. For a person in 
an app’s periphery, we can attempt to predict future adoption of 
the app based on individual characteristics and network structure. 
We find that apps are diverse in the way in which the structure on a 
user’s friends is related to adoption probabilities, and we find an in¬ 
teresting effect in the interactions among individual characteristics: 
a user’s probability of adopting an app depends on the three-way re¬ 
lationship among their own attributes, the attributes of their friends 
who use the app, and the modal attributes of the full population of 
app users. 

2. DATA 

The data for this study comes from anonymized logs of Face- 
book Login daily activity, collected between January 2009 and June 
2014. Facebook Login is a secure way for Facebook users to sign 
into their apps without having to create separate logins. 

The various analyses in this paper required different slices of 
these logs, both considering the observation window and the apps 
being observed. Table 1 summarizes the different subsampled data 
sets that will be referred to throughout this work. The data for 
this study has granularity of one day; that is, we have logs about 
whether an individual uses a specific app on each day. All user 
level data is de-identified. 


tag 

selection 

criteria 

time period 

size 

APPSrand 

random oc 
DAU(2014-06) 

Jan. 2014 - 
Jun. 2014 

83,000 apps 

APPS pop 

most popular by 
MAU(2013-06) 

Jan. 2009 - 
Jun. 2014 

2,319 apps 

1.4 x 10 9 users 


Table 1: Summary of data sets considered in this paper. DAU 
and MAU refer to Facebook Daily Active Users and Monthly 
Active Users, respectively. Our initial overview analyses con¬ 
sider APPS rand, while our subsequent and in-depth analyses 
consider APPS po p and, as occasion permits, various subsets 
of it (APPS pop{x})* Unless otherwise noted, subsampling in 
this work is done on apps, not on users. 

The frequency with which the Facebook Login service is called, 
and hence daily activity is registered, depends on several factors. 
Web-based activity relies on authentication tokens that expire on 
the order of hours, while mobile apps can optionally request to¬ 
kens that are valid for days, provided the user does not change their 
password. For some apps, we do see a periodic activity, typically 7 
days apart, consistent with longer-term authentication tokens being 
refreshed. This periodicity is a small effect relative to the overall 
activity, as we show below. This is likely because other activity, 
such as posting updates or retrieving public profiles or friend lists, 
again requires reconnecting. Therefore, Facebook Login provides 
a reasonable proxy of daily use of the app. It allows us to charac¬ 
terize the app’s adoption and retention. 

3. SOCIAL PROPERTIES OF APPS 
3.1 Popularity and sodality 

One question that has been raised previously is how big of a role 
the social network plays in the adoption of apps. This parameter 
has been inferred indirectly by Onnela and Reed-Tsochas G3 in 
their study of the very early adoption of Facebook apps. It is also 


estimated in the model proposed by Ribeiro p~8) , where individu¬ 
als can drive their friends’ adoption and re-engagement. 

However, these prior studies did not directly measure whether 
app adoption was in fact correlated on the network, and so we turn 
to this task presently. In particular, we would like to place apps in 
a low-dimensional space that can provide a view for how they are 
distributed across the social network of users. 

To do this, we begin with two basic definitions 

• We say that the popularity of an app, denoted p(x), is the 
probability that an individual selected uniformly at random 
from Facebook’s population is a user of the app. 

• We say that the sociality of the app, denoted p(x\y), is the 
probability that a member of Facebook is a user of the app 
given that they have at least one friend using the app. 

Studying the distributions of p(x) and p(x\y), and how they are 
jointly distributed across apps, allows us to ask a number of ques¬ 
tions. In particular, how socially clustered is the app? And how 
does it depend on the type of app, or characteristics of the app’s 
users? 

Note that if p(x\y) is very high for an app, it means that its user 
population in a sense “conforms” to the structure of the underlying 
social network. 

Moreover, p(x\y) can in principle be high even when p(x) is low 

— this would correspond to an app that is popular in a focused set 
of friendship circles, but not on Facebook more broadly. On the 
other hand, if p(x\y) is not much more than p(x), then it says that 
users of the app are spread out through the social network almost 
as though each member of Facebook independently flipped a coin 
of bias p{x) in order to decide whether to become a user of the app 

— there would be no effect of the social network at all. 

Plotting apps in popularity-sociality space 

An appealing feature of this pair of parameters is that it provides 
a natural two-dimensional view of the space of all popular apps on 
Facebook. We show this view in Figure [T] — a heat map show¬ 
ing the density of apps at each possible (discretized) pair of values 

(p(x),p(x\y)). 

We see in Figure^ that the apps fill out a wedge-shaped region 
in the p(x)-p(x\y) plane, and it is informative to understand what 
the boundaries of the region correspond to. First, note that if the 
social network had no relationship to app usage, we would see the 
diagonally sloping line p(x) — p(x\y ); in the plot this corresponds 
to a line that lies slightly below the diagonal lower boundary of the 
points in the heat map. Thus, there exists a frontier in the space of 
apps that is almost completely asocial — those apps that lie paral¬ 
lel to this diagonal line — but essentially no apps actually reach the 
line; even the most asocial apps exhibit some social clustering. We 
see this in the approximately horizontal top boundary of the points 
in the heat map — this is a frontier in the space of apps where 
knowing that a person x has a friend using the app gives you a fixed 
probability that x uses it, independent of the app’s overall popular¬ 
ity on Facebook. The location of this horizontal line is interesting, 
since it provides an essential popularity-independent value for the 
maximal extent of social clustering that we see on Facebook. 

Note that the wedge-shaped region in a sense has to come to a 
point on the right-hand side, as p(x) becomes very large: once an 
app is extremely popular, there is no way to avoid having pairs of 
friends using it almost by sheer force of numbers. And given the 
crowding of app users into the network, there is also no way for the 
extent of social clustering to become significantly larger than one 
would see by chance. 
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Figure 1: App sociality. Top left: Horizontal axis is app 
popularity, and vertical axis is the relative increase in adop¬ 
tion likelihood for people who have friends who also use the 
app. Right panels: Horizontal axis is app popularity, verti¬ 
cal axis is app sociality. The colors represent the number of 
apps falling within the given bin. The lower right panel uses 
APPS rand, while the other three panels use APPS pop (see 
Table [l]for details). The labeled colors indicate the relative fre¬ 
quencies of observations in each bin, such that the lowest values 
have been normalized to 1. Bottom left: Matrix indicating the 
p-values for the two sided Kolmogorov-Smirnov test compar¬ 
ing the distributions p(x\y)/p(x) for apps within each pair of 
the nine listed categories. White indicates a lower p -value and 
black indicates a higher one. 


The third, lower left boundary of the wedge is a manifestation of 
the Facebook friend limit of 5000: if a user is friends with some¬ 
one who uses the app, at least 1/5000 of their friends use it. We 
approach this limit in the far left hand of this Figure with apps that 
have two users, one with 0 other friends using the app and the other 
with 1 of their nearly 5000, combining to approach the lower limit 
of 1/(2* 5000). The lower bound decreases as 1/ (n * 5000) as the 
number of users, n, increases. 

3.2 Analysis of Social Neighborhoods 

We saw in the previous section that app adoption can be local¬ 
ized in the social network. But what is the mechanism by which 
the app is adopted by friends? Is homophily driving both friend¬ 
ships and adoption of specific apps based on interests? Or is the 
primary mechanism one of exposure, where having even just a sin¬ 
gle friend who has installed the app now gives an opportunity to 
become familiar with the app and subsequently adopt it? 

To answer these questions, we observe the Facebook friendship 
graph at time t. For each app, we consider everyone on Facebook 
who has friends using the app, but who has not used the app them¬ 
selves by t. Are there features of the individual and their friends 



Figure 2: Relationship between national identity of potential 
app adopters, that of their current user friends, and the likeli¬ 
hood of their adopting the app. Blue: Relative adoption rates 
when a potential user and their current user friend are from 
the majority to when potential user is in majority and current 
user friend is in minority. Green: Relative rates when poten¬ 
tial user and current user friend are in same minority to when 
potential user is in minority and current user friend is in ma¬ 
jority. Red: Relative rates for when potential user and current 
user friend are in same minority to when they are in different 
minorities. The blue curve indicates that when a potential user 
is from the majority country, their current user friend could be 
from either the majority or a minority and they are still equally 
likely to adopt an app more often as less. In contrast, when the 
potential user is from a minority country, in 75% of apps they 
adopt more frequently when their current user friend is from 
the same country as them. 


that will predict whether the individual will adopt the app at some 
point in the future? 

One-node neighborhoods 

We begin with a question about homophily and its relation to app 
usage. Consider a Facebook user A who does not currently use the 
app, and suppose that has exactly one friend B who uses the app 
(that is, A has between 1 and 5000 friends on Facebook, but for 
our purposes here, exactly one of those friends is an app user). We 
choose some attribute on users (for example nationality, or age); we 
let f(A) and f(B) denote the value of this attribute for A and B , 
and we let /* denote the modal (or median) value of the attribute 
across all app users. 

Now the following question arises. Suppose that A is different 
from the typical user in this attribute, in the sense that f(A) / /*. 
Is A more likely to adopt the app if the friend B is similar to A, or if 
B is similar to the typical app user? This is a basic question about 
the role of individual similarity in adoption decisions — if we’re 
studying potential users who are outside the target demographic, is 
it more effective for their app-using friends to be similar to them, 
or similar to the target demographic? 

We study this for nationality as an attribute in Figure [2] — given 
an app, we index nationalities so that the most common national¬ 
ity among users of the app is labeled 0, and other nationalities are 
labeled by values i > 0. We now say that a(i,j) is the adop¬ 
tion probability of a user A who has one friend B using the app, 
with f(A) = i and f(B) = j. (Note that /* = 0 according to 
our notation, since 0 is the most prevalent nationality in the app.) 
The figure shows that for a considerable majority of apps, we have 
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Figure 3: The probability that a user adopts the app given that 
they have one friend using the app. as a function of (left) the 
friend’s age offset from the median and (right) the user’s age 
offset from the median. The left plot indicates no apparent rela¬ 
tionship between the age of the friend and that the user adopts. 
In contrast, the right plot illustrates that young users and users 
who are aged between 10 and 30 years above the median age are 
more likely to adopt. Users who are more than 40 years older 
than the median age are less likely to adopt. The probabili¬ 
ties were binned by age into 20 equally populated bins and the 
reported adoption probabilities are bootstrap estimates. The 
thick central line is the median bootstrap estimate of the mean, 
while the three outer bands indicate the 68%, 95%, and 99.7% 
confidence-intervals. 


a(i, 0) > 1, indicating a clear aggregate tendency for the 

question in the previous paragraph: a user A is more likely to adopt 
the app in general when A ’s one friend using the app is similar to 
A, not to the typical app user. In contrast, when f(A) = /*, the 
ratio a(0, 0)/a(0, i) is balanced around 1, so there is no clear ten¬ 
dency in adoption probabilities between the case f(B) = /* and 
f(B) ^ /*: for users who have the modal attribute value, the 
attribute value of their friend does not have a comparably strong 
effect. 

This style of question gives us a way of analyzing individual 
attributes in general, and we find that attributes differ in the way this 
effect manifests itself. For example, when we consider age as an 
attribute (in place of nationality), we see (Figure [ 5 } that if A has a 
friend B using the app, the age of B has very little correlation with 
A ’s probability of adopting. However, the age of A is related to the 
adoption probability — users A who are much older or younger 
from the median age are relatively more likely to adopt the app if 
they have a friend who uses the app, compared to individuals who 
are near the median age themselves. 

Two- and three-node neighborhoods 

We can also use the population of apps, and the adoption decisions 
that people make about them, to address a recurring recent question 
in the literature on on-line diffusion. Given an individual A who is 
not currently using an app, but who has k friends B \, B 2 , •.., Bk 
using it, how does A’s adoption probability depend on the pattern 
of connections among these k friends? Is A more likely to adopt if 
there are many links among these friends, or very few? 

Past work has suggested that the answer can depend on the adop¬ 
tion decision being studied. Consider the results observed by Back- 
strom et al. (lj with Live Journal data, where conversion probabil¬ 
ity increases with the connectedness of one’s friends, and contrast 
them with those observed by Ugander et al. |23] with Facebook e- 
mail invitations, where, for a fixed neighborhood size, one’s proba¬ 


bility of conversion strictly increased with the number of indepen¬ 
dent components. In both cases the result is non-obvious, as there is 
no a priori mathematical reason that the effect should be monotone 
with connectedness of one’s neighbors. 

Given the diverse answers arising in prior work, and the conse¬ 
quent suggestion that the result depends on the adoption decision, 
we consider how adoption probabilities vary with neighbor connec¬ 
tivity across a large sample of the most popular apps. 



Figure 4: Right: the baseline rate of adoption given that a user 
has two friends using (horizontal) and the ratio of probability of 
adoption given friends are connected to probability given that 
friends are not connected (vertical). Apps above the line y — 1 
exhibit the same trend as Livejournal adoptions, and those be¬ 
low follow the trend observed in Facebook adoptions. Left: 
Closed to open conversion ratio for two-node neighborhoods 
(horizontal) and three node neighborhoods (vertical). Apps in 
the upper right quadrant follow the Livejournal trend for two- 
and three-node neighborhoods, while apps in the lower left fol¬ 
low the Facebook trend. The correlation between these rates is 
0.98 with p « 0.01, and there is a stark deficiency of apps in 
the diagonal quadrants. 

We begin by considering the question just for two-node user 
neighborhoods, asking it separately for six hundred apps from 
APPS pop- given that a non-user A has exactly two friends using 
the app at time t , how does A ’s adoption probability depend on 
the presence or absence of an edge between A ’s two friends? Note 
again that these users may have any number of Facebook friends 
between 2 and 5000, but that only two of those friends can be app 
users. We explore this question in Figure [4] We find that both pos¬ 
sibilities — higher or lower adoption probability with the presence 
of an edge — occur in roughly comparable proportions across the 
population of apps, suggesting that at the level of two nodes, both 
possibilities are indeed prevalent. 

As often happens in the analysis of phenomena on small social 
subgraphs, we begin to see some rich structure emerge in the ques¬ 
tion when we move to three-node neighborhoods. For a graph G, 
we let a(G) denote the adoption probability of user A when A’s 
neighbors induce the graph G, and note that on three nodes there 
are four possible graphs: the complete graph K 3 , the three-node 
path Ps, the single-edge graph P 2 U Ei, and the empty graph E 3 . 

We find (Figure [4]) that the ratio a^Kz)/a^Ez) covaries closely 
with a(K 2 )/a(E 2 ) across the set of apps — in other words, when 
the adoption probability of an app is higher for a connected pair of 
friends, it is also higher for a connected triplet of friends. This in¬ 
dicates how properties of two-node neighborhoods provide strong 
information about the properties of larger neighborhoods — and it 
is an empirical regularity of the adoption decisions rather than a 
strictly mathematical one, since the property for three nodes does 
not follow from the property on two nodes. We find similar regu- 
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Figure 5: Two views of the same 3D point cloud: 

apps positioned according to the ratios a(E$) /a(iT 3 ), 
a(P 2 U AT)/a(iT 3 ), and a(P 3 )/a(iT 3 ), and colored such 
that blue apps have a(K 2 )/a(E 2 ) > 1, and red have 

a(K 2 ) / a(E 2 ) < 1. All adoption rates are reported relative to 
the rates for when a friend’s user friends are a clique. Red 
apps have higher adoption rates with lower connectivity in the 
two-node graphs, and we see a near perfect correspondence in 
this trend for each possible combination of connectivity in three 
node graphs; this is demonstrated by the blue and red points 
falling naturally on either side of 1 in all three dimensions. In 
the left hand view the vertical extent of the cloud demonstrates 
the natural variation in relative adoption rates when a friend’s 
user friends form two components (y-axis) compared to three 
(x-axis). In contrast, in the right hand view of this point cloud 
we observe more limited variation in relative adoption rates 
when the user friends are connected in one component. These 
differences are reflected in the Pearson correlation coefficients 
of 0.94 between a(P 3 )/a(iT 3 ) and a(P 2 (J AT)/a(iT 3 ), and 
0.57 between a(P 3 )/a(iT 3 ) and a(P 3 )/a(iT 3 ) (p < 1CT 10 in 
both cases). 


larities in the fact (Figure [5j that when adoption probabilities are 
higher for a(P 3 ) relative to a(K 3 ), they are also higher for the 
next-sparsest graph a(P 2 U Pi). And we see comparatively much 
less variation between a(P 3 ) and a(P 3 ), which is consistent in 
interesting ways with the finding of Ugander et al. (23) that neigh¬ 
borhood topologies inducing the same number of connected com¬ 
ponents tended to lead to similar adoption probabilities. 

4. TEMPORAL PATTERNS IN APPS 

In addition to learning how the adoption of apps depends on 
friendship ties, we’d like to characterize the app’s ability to retain 
those users who have adopted. These temporal features of an app’s 
evolution hold some of the keys to its success. 

To get a sense for what the retention of users looks like at a global 
level, we show the evolution of usage for a sample of apps in Fig¬ 
ure [6] The images in this figure are heat maps in which the cell in 
the (z, j) entry records the density of users who first used the app at 
time i and last used it at time j. These maps thus show periods of 
heavy recruitment as horizontal bands — days when many individ¬ 
uals first started using the app. While there are a few vertical bands, 
denoting a narrow period of time when many users were last active, 
there is a clear concentration along the diagonal of many users de¬ 
parting soon after their first login, while some, located off-diagonal, 
remain active much longer. There are also sudden increases and de¬ 
creases in density, as apps became more or less popular. With this 
type of global view of the diversity in usage and retention patterns, 
we next turn to modeling these retention patterns; our goal is to ex¬ 
tract parameters from such a model to use as features in predicting 
an app’s future engagement. 



Figure 6: App adoption and departure dynamics. Heat maps of 
aggregate first and last login times for users of several examplar 
apps. The y-axis corresponds to the date of the first login and 
the x-axis to the last. The concentration is from blue (few) to 
yellow (many). Bright yellow and green horizontal or vertical 
bands correspond to periods of rapid adoption and departure, 
respectively. The color scale increases in density from white to 
blue, then yellow, then red. 


4.1 Retention model 

For an app to have long term success we expect that it needs to 
maintain a relatively high level of user retention. We would like 
to have a model of retention that characterizes not only whether an 
individual will log into the app the very next day, but for any day 
subsequent to their first login. 

Past work on retention modeling (e.g. wm> has been fo¬ 
cused on a particular product/activity (mostly online games), try¬ 
ing to predict users’ continous engagement with a wide selection 
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Figure 7: Empirical retention data, model predictions, and pa¬ 
rameters. (a,i): Retention data and model predictions for an ex¬ 
emplar app. Error bars on the data (red sol id curve) rep resent- 
ing 99.999% confidence intervals (4.4172^(1 — p)/n). (a,ii): 
Error corresponding to fit shown in (a,i). (b): Distribution of 
fitted retention model parameters for the apps in APPSpop* 
The shade represents the frequency of fitted parameter values 
falling into the given bin (darker being more frequent), (c): 
Mean error achieved by the model for apps with fitted param¬ 
eters in the corresponding bin range. The same linear shade 
scale is used for both panels, with the lightest gray being 10 _5 
and black being 3 (white corresponds to no data), b and Xb de¬ 
note the parameters in the time dependent model, and A and 
x a those in the time indepdendent version. 


of features, many of them are domain-spefic and computationally 
expensive. To study thousands of apps and billions of users, we 
want to propose a model that is easy to compute and highly gener- 
alizable. 

Simplest model: exponential decay 

We start with the population of newly installed users, n(0), and 
assume that at every time step each user has a constant probability 
of leaving, xo. This mechanism gives rise to exponential decay: 

= —xon(t) —^ n{t) = n(0) exp(— xot), (1) 

where n{t) is the number of app users at time t. It turns out that 
this model does not yield a good fit to the data. However, the fit 
improves if we introduce a second parameter to the model by fitting 
from the second day; that is, we fit both for the decay rate, xo, and 
the fraction of users that returned on day 2. With this relaxation 
from fitting day 1, the model becomes 

n{t) =» An(0) exp(-xot). (2) 

It is interesting that the exponential decay model fits the day 2 
and onward trend well while not fitting day 1: it is reasonable to 
expect that there is a discontinuous transition in the probability of 
returning given an install versus an install and a return the following 
day. Day 1 users are dominated by those who use the app exactly 
once, whereas all the other days contain a signal from users who ex¬ 
hibited at least some level of continued interest in the app. Despite 


its ability to capture this distinction, this model is unsatisfactory: 
it entirely ignores the day 1 users, and even in the two-parameter 
version it under-predicts retention at long times (Figure [TJ. 


Introducing simple time dependence 

Instead of assuming that people have a constant probability of leav¬ 
ing at every time step, let us assume that their probability is a simple 
function of time: 


dn(t) 

dt 


— ^n(f) n(t ) — n(0) exp 


x a t 


1 — a 


( 3 ) 


This model allows for the possibility that their likelihood of re¬ 
turning to the app could have a time dependent component, and it 
introduces this time dependence with the addition of only one pa¬ 
rameter. Notice that by setting a = 0 we recover the traditional 
exponential decay model. 

The parameters in this model have an interesting interpretation. 
Smaller values of a indicate that the app users have more momen¬ 
tum, that is, the app has more sticking power. The parameter x a is 
still related to the familiar probability of depature: small x a indi¬ 
cates that users are more likely to continue using the app. 

4.2 Temporal analysis 

After studying the temporal dynamics of individual apps through 
our retention model, we now look for regularities in the temporal 
patterns across multiple apps. We start this analysis by taking a 
random sample of all apps, and clustering their time series of daily 
active users using a k-means algorithm. All the time series are 
normalized by the number of active users on the app’s peak day. 

By varying k, we can get different sets of temporal clusters (see 
Figure |8j. The two clusters generated for k = 2 already capture 
two dominant temporal trajectories of apps: one exhibits a clear 
rise and fall, while the other exhibits a slow but more sustainable 
rise. Note that the slow rise shown in the second cluster here may be 
misleading: as these apps keep growing and we normalize the time 
series by total volume, apps with a bigger user base will appear to 
have fewer fluctuations and slow growth/drop. When k increases, 
other small temporal clusters emerge, but they are not significantly 
different from the two typical ones. 

Then, for all apps that existed on June 1 2013, we compute their 
monthly active users (MAU) on that day and one year out. We plot 




Figure 8: Left: A;-means centroids of DAU for APPS rand, for 
k = 2 and k — 3. The clustering was done on the peak nor¬ 
malized time series of a 100-day observation window. Right: 
K-means scores (mean of distances from nearest centroid) for 
various values of k. Error bars represent 95% confidence in¬ 
tervals. Scatter plot is derivative of scores. Notice that no sta¬ 
tistically significant improvement is gained for k > 2; that is, 
for all k > 2 the score of the resulting clustering is statistically 
significant different from that for k — 2. The scores were eval¬ 
uated using a 75-25 train-test split, the clusters were generated 
with 100 restarts, and L2 was used as the distance metric. 
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Figure 9: MAU of all apps in June 2013 (horizontal) and 
June 2014 (vertical). Left: Number of apps with MAU@U 
and MAU@f 2 corresponding to the specified bins. Right: 
P(MAU@f 2 l MAU@ti) is the empirical probability of an app 
having y users at f 2 given that it had x users at t±. At t 2 we 
observe that apps tend to either continue at the same level of 
popularity as they experienced at ti (bright diagonal) or ex¬ 
hibit a dramatic decrease in popularity (bright band on hori¬ 
zontal axis). Apps that are more populare have greater rates of 
continued success. However, when their popularity drops, the 
collapse tends to be complete. 

those distributions against each other in Figure [9] where the color 
indicates the number of apps with the stated pairing of MAUs. The 
right-hand figure normalizes the columns, so that each bin column 
can be interpreted as the probability of ending up at the indicated 
MAU, given the current position. We can see that apps are most 
likely to stay at approximately the same MAU, but that, especially 
for very popular apps, there is a subpopulation that loses almost 
all of their users. This pattern suggests a natural underlying binary 
prediction problem: given that an app enjoys current success will it 
continue to be as popular one year later? 

5. PREDICTING APP SUCCESS 

In the previous sections we have seen that apps can be described 
in a variety of ways. We began by exploring the relationship be¬ 
tween an individual’s social network and their likelihood of adopt¬ 
ing an app, and in general how app usage is clustered in the social 
network. We also related a user’s likelihood of adopting an app to 
their individual characteristics relative to those of the current app’s 
users. Next we observed that, though overall patterns of adoption 
can be quite complex, an app’s retention properties are well de¬ 
scribed by a simple model with a small set of parameters. And fi¬ 
nally we again saw that while the fine grained activity level for any 
app is complicated, in the long term apps tend to either continue at 
the same activity level or diminish in popularity. 

In each analysis we considered either hundreds or thousands 
of popular apps from this ecosystem, and saw that these various 
low dimensional features had interesting and diverse distributions 
across the population. 

This brings us to our final set of questions: can we use an app’s 
social, demographic, retention, and temporal features to predict 
whether or not it will be successful in the long term? 

5.1 Predicting the longevity of apps 

Note that we have seen empirically that the question of an app’s 
long term success is well approximated by a binary variable (see 
Figures [ 8 ] and [9}. In this subsection and the next, we will consider 
two variations on a binary prediction task. One task is straight¬ 


forward: given a collection of promising apps, we want to predict 
which apps will have persistent success over the next year. The 
other task is based on a pairwise evaluation: to compare a pair of 
similarly popular apps and predict which one will be more success¬ 
ful in the future. 

First, we consider the task of predicting which apps in the entire 
population will continue to be successful. Based on the number of 
active users on June 2014, we label an app as a positive example if 
it has over 50% of the number of active users it had in June 2013, 
and we label it as a negative example if has lost more than 50% 
of its users. This labeling turns out to provide us with a balanced 
class distribution, with the guess-all-positive baseline being 50%. 
For this binary classification task we built and evaluated the model 
by training random forests on apps in APPS pop, where each app 
is represented as a vector of the features in Table [2] 

The prediction performance results are shown in Table[3] and the 
use of all the available features leads to performance above 70% on 
this binary task. We find that the temporal features are the best 
single set of features, with the most important features being the 
median number of users in months 8 and 9 of the 12 month obser¬ 
vation period (June 1 2012-June 1 2013 - see Table [lj. The apps 
that would continue to be successful also had a higher weekly min¬ 
imum; given that the overall popularity of the apps between classes 
is evenly distributed, we interpret this high weekly minimum as a 
signal of stability, and that this stability was a positive predictor. 

Individual user attributes yielded the second highest performance, 
with the most important class features being activity-based ones: 
h ,7 and Iq ,7 ( Ik ,7 is the fraction of app users that were also ac¬ 
tive Facebook users for past k out of 7 days). We observe that for 
k = 0 ,..., 6 , negative examples are correlated with greater val¬ 
ues of Ik, 7 , whereas for £ 7 , 7 , the trend reverses, and the positive 
examples with more users who are active on Facebook every day. 
This means that having users who were also highly active Facebook 
users is a positive indicator of success. 

Among all the retention features, the most important one was the 
fitted parameter xa, which represents the “departure probability” 
in the exponential decay model of users leaving an app. Not sur¬ 
prisingly, we find that the positive examples tend to have lower xa 
than negative examples, indicating that having users who continue 
using the app for an extended period of time (i.e. a lower leaving 
probability) is correlated with the app’s long-term success. 

Finally, the most important structural features were sociality, i.e., 
average user degree, and mean/max number of friends who used the 
app. For the latter two we could not notice any significant differ¬ 
ences between the two classes, but we do notice that high sociality 
is a negative indicator of success. This is likely due to the fact that 
we normalize the sociality measure (p(x\y)) by the popularity of 
the app (p(x )); thus those apps with very high sociality score are 
relatively small, and tend to be the ones situated in a very specific, 
niche market. Indeed, we find that if we consider the separate distri¬ 
butions of the numerator and denominator, we observe thatp(x|y) 
is indistinguishable for the two classes, while p(x) is a positive 
indicator, leaving p{x\y)/p{x) as a negative indicator. 

SIRS model 

In general the task of predicting an app’s time-series trajectory is 
a rich and interesting problem, but the binary nature of trajectories 
that we observed motivated our simplification to the binary pre¬ 
diction task. To explore the potential inherent in modeling richer 
properties of the time series, we also consider a model of app us¬ 
age via a set of interacting reaction diffusion processes, much like 
a chemical reaction. The model we use was proposed by Ribeiro et 
al. |47 |, and falls into the well-known class of SIRS models. We 




Temporal 

med /min/max DAU mo .x 
med /min/max ADAU mo .x 
med /min/max A 2 DAU mo .x 
med /min/max DAU year 
med /min/max A DAUyear 
med /min/max A 2 DAUyear 

A year DAU 

A yea r A DAU 

Ay ea rA 2 DAU 

*WAU, MAU, users, new users 

median, min, max number of daily users in month X of observation 

median, min, max of change in daily users within month X (DAUx— DAUx-i) 

median, min, max of second order change in daily users within month X 

median, min, max of DAUx for X E 1,..., 12 

median, min, max of A DAUx for X E 1,..., 12 

median, min, max of A 2 DAUx for X E 1,..., 12 

med DAU 12 - med DAUi 

med A DAU 12 - med A 2 DAUi 

med A 2 DAU i2 - med A 2 DAUi 

Same statistics as listed for DAU above, considering instead weekly users, monthly users, total users, 
and new users 

Demographic 

Country x / P(Countryx) 
Genderx / P (Genderx) 

Agex /P(Agex) 

Ik, 7 / P(h, 7 ) 

is30 / isnot30 / P(is30) 

Entropy (Country) 

Entropy (Gender) 

Entropy (Age) 

Entropy^) 

Entropy (is 30) 

Number, fraction of users from country X 

Number, fraction of users who stated their gender to be X 

Number, fraction of users who stated their age to be X 

Number, fraction of users who were active on Facebook for k out of 7 days 

Number of users who are / aren’t monthly active Facebook users; fraction of users who were monthly 
active Facebook users 

Entropy of country user distribution: — Z^xgC ountries P(Country x ) log 2 P(Country x ) 

Entropy of gender user distribution: — X^xe Genders P(Genderx) log 2 P(Genderx) 

Entropy of age user probability distribution: — J^xeAges ^(Age^) 1°§2 ^P(Age x ) 

Entropy of l 7 distribution: - J2 ke i,...,7 ^(^, 7 ) log 2 P(l k , 7 ) 

Entropy of is30 distribution: — [P(zs30) log 2 P(zs30) + (1 — P(zs30)) log 2 (l — P(zs30))] 

Retention 

N(t) 

P(t) 

a, x a 

A, xo 

Number of users who returned t days after their first login 

Empirical probability of a user returning t days after their first login 

_ a 

Parameters for best fits of time dependent model: N(t) — N (0 )exp —A - 

[ 1 — a 

Parameters Least squares parameter fits of time independent model: N(t) = AN 

(0)exp[—xot] 

Social 

med / max deg 
med / max using 

p(x\y) 

p(x\y)/p{x) 

median and maximum number of friends of an app user 

median and maximum number of friends of an app user who also use the app 

sociality: empirical probability of having adopted an app given that a friend has, 

i.e. mean fraction of an app user’s friends who also use the app 

relative change in probability of a user adopting an app given that their friend has 

SIRS model 

So 

a 

P 

7 

6 

pred(day k ) 

susceptible population size, i.e. number of Facebook users who are interested at this app 

probability of a non-user adopting the app due to non-social reasons 

probability of a non-user adopting the app through social process 

probability of active user becoming inactive 

probability of in-active user being drawn back by active users 

the DAU prediction at day k for k between 2013-06-01 and 2014-06-01 


Table 2: Features used for training and testing the binary app success prediction tasks. Features were measured for all apps in 
APPS pop (see Table[l}, with the exception of the SIRS model features due to issues of model convergence. 























will briefly describe how we implemented this model, and when 
we return to our underlying prediction task, we will consider the 
predictions and parameters from this model as an additional set of 
features. 

Ribeiro et al. Gz) proposed a model describing the dynamics of 
a webservice of daily activity time series, derived from the clas¬ 
sical epidemic model and comprised of a set of reaction diffusion 
processes. The model is specified by a set of parameters, including 
the estimate of the susceptible population, and the transition proba¬ 
bilities between different states. Ribeiro also outlines a framework 
for fitting these parameters given a window of time series activ¬ 
ity levels, and then uses them to extrapolate and make a long term 
prediction of future activity levels. We implemented a model very 
similar to the one described in G3- We fitted the model using a 
Monte Carlo process using time series from June 2, 2012 to May 
25, 2013 (the same period from which we extract temporal fea¬ 
tures), and used the fitted model to generate predictions between 
May 26, 2013 and May 15, 2014. 

There are two things we note about the SIRS model. First, as 
we try to predict the future of apps from a fixed time point, the 
apps we are studying can be in very different life stages. For exam¬ 
ple, some apps in our dataset had only existed for a short period of 
time by the observation day, and thus have very limited time series 
data to compute a good fit of the SIRS model. Second, some un¬ 
derlying assumptions in the SIRS model, such as the constant rate 
of user adoption through advertisement or word-of-mouth process, 
may not hold in reality. As a result, the model would not converge 
for certain apps, especially the ones that experienced large fluctua¬ 
tions in their lifecycles. 

Nevertheless, we were able to fit over two-thirds of the apps in 
APPS pop- Among them, 1100 apps had reasonable convergence 
and error estimates. We then used both the fitted parameters and the 
predicted time series as our features for this subset of 1100 apps. 
On that subset of 1100 apps, the relative performance of the other 
features sets was the same (all combined features yield the highest 
performance, followed by temporal, then demographic, retention, 
and finally social). 

We find that the features from the SIRS model perform worse 
than the retention features but better than the social features. Thus, 
despite the richness of the time-series modeling made possible by 
the SIR framework, as a feature set it does not perform as well 
as other measures incorporating temporal properties, including the 
retention model from the previous section. 


5.2 Predicting pairwise relative success 

Next we formulate a separate but related prediction task, by con¬ 
structing a pairwise comparison version of predicting app success. 
Given that two apps have approximately the same monthly active 
users at t\ (MAU@£i), and by £2 they had diverged from each 
other, we want to predict at time £1 which app is going to be more 
successful. We evaluate this problem with a variety of thresh¬ 
olds for what we considered “near-” and “long-”term predictions 
of MAU. This prediction task is particularly useful when investi¬ 
gating a set of competitive apps in the same market. Intuitively, it 
is difficult to tell similar apps apart at an early stage jT8]|. However, 
by looking at pairs whose outcomes at £2 are successively farther 
apart, we can control for the difficulty of the task and understand 
when it becomes feasible to predict such divergence. 

For the pairwise prediction task we begin by generating a 50-50 
train-test split between apps, and represent each pair of apps as a 
concatenation of two feature vectors, again using the features from 
Table [2] We then introduce a subtle variation to make the setup 


Feature set 

accuracy 

prec: 

+;- 

recall: 

+;- 

top 2 features: 

{among all}; 

{within class} 

Baseline 

0.50 




All 

0.73 

0.72; 

0.74 

0.74; 

0.72 

med users8 . 
medusersg ’ 

Temporal 

0.71 

0.72; 

0.7 

0.68; 

0.74 

AyearlFAf/, 

minVF AU\i ’ 
med users8 
med usersg 

Demographic 

0.66 

0.64; 

0.68 

0.70; 

0.61 

^6, 7, £ 5 , 7 ; 

16,7, h,7 

Retention 

0.61 

0.59; 

0.64 

0.70; 

0.53 

x a , xa; 

day 2 and 3 returns 

Social 

0.6 

0.59; 

0.61 

0.60; 

0.59 

p(x\y) , 

———, (user degree); 
p(x) 

Mean and max # of 
friends using the app 


Table 3: Prediction performance results for five combinations 
of features. Precision and recall: top and bottom rows are for 
positive and negative classes, respectively. Features are ranked 
by out-of-bag importance estimates while training the random 
forests. We trained the classifier using all the features, and re¬ 
port the most important ones in each category in the top row 
(“among all”)? and train the classifier with only the features in 
each category, and report the top opens in lower row (“within 
class”). 


more relevant to a real-world scenario. The features and labels used 
in the training stage are generated using snapshots of our datasets 
at to and ti = to + 6 months, while those used for testing are 
generated using snapshots at £1 and £2 = £1 + 6 months. This 
simulates the practical scenario of observing the app population at 
£ 1 , learning which characteristics of apps lead to their success, and 
using the learned knowledge to predict the future. 

Two apps are considered to start off as being “comparable” if 
they fall into the same decile at £ 0 /i (train/test), and are consid¬ 
ered “distinct” if they are at least k deciles apart at £i/ 2 (train/test). 
In Figure [lO] we see that prediction accuracy increases monotoni- 
cally with k, and that the best set of features (temporal) ultimately 
yield 75% prediction accuracy. The other most striking feature of 
Figure [lO] is that for most of the threshold window, all the features 
yield approximately the same performance. Each set of features, 
besides demographic, takes a turn at being both the top performer 
and the lowest. The individual feature analysis that we did was 
consistent with the observation that this task is not highly sensi¬ 
tive to the choice of features. To analyze which features could best 
discriminate between positive and negative examples we used the 
two-sided Kolmogorov-Smirnov test to compare the distributions 
of each feature for positive and negative examples. We find that, 
with the exception of a few underpopulated demographic features, 
the Kolmogorov-Smirnov test finds that each feature is distinguish¬ 
able between the negative and positive examples with p -values ex¬ 
tremely close to zero. 

6. RELATED WORK 

Sociologists and economists have long studied the problem of 
product adoption and retention. Early work in this domain fo¬ 
cused on the diffusion of innovations, as people proposed a series 
of mathematical models to describes the adoption of new products 
by consumers, such as the “S-shaped” adoption curve [71 and the 
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Figure 10: Prediction accuracy for the pairwise relative success 
prediction task, as a function of decile threshold, k . 


Bass model 0. These models have been successful in predict¬ 
ing the impact of advertising, especially the effect of advertising 
through mass-media and billboards. Other work has focused on the 
diffusion of innovations and products through social ties fl~9| . With 
the rise of social media and online social networks, there has been 
more and more evidence that the social influence, i.e. the word-of- 
mouth effect, is playing a increasingly important role at driving the 
adoption of products and services mm- 

To understand how products and information spread in social 
networks, most existing work tries to predict the volume of pop¬ 
ularity, such as the the size of online communities [1], the number 
of fans of Facebook pages (22) , and the usage of hashtags on Twit¬ 
ter |2|20|25| . While these work showed the correlation between the 
scale of diffusion and its structural and topical properties, there has 
been a recent line of work questioning the predictability of large 
viral events j2|[2T]. In response, Cheng et al. |5j showed that it is 
possible to predict how much more a cascade will grow by observ¬ 
ing the temporal and other features of its spread up to the present 
time. 

Besides being a key predictor for cascade size, the temporal dy¬ 
namics of cascades have been an interesting research topic m 
[28]. Upon the discovery of several robust temporal classes of cas¬ 
cades on different platforms, most studies on the temporal dynam¬ 
ics focused on bursty events |24] , or the peak volume (6]|T3]. In¬ 
deed, the majority of popular things spread on-line enjoy very short 
attention span: the popularity rises and drop quickly, usually within 
a few hours or a day |26[|28| . The persistence of interest, although 
rare, is rather intriguing. Wu et al. |26) found that the longevity of 
URLs on Twitter can be explained by the intrinsic cultural value of 
the content they link to. Follow-up work showed that information 
with positive sentiment is more likely to persist (27) Ducheneaut et 
al.. discovered that smaller and denser guilds in World of Warcraft 
are more likely to survive longer (9). 

While many papers correlate the temporal patterns of cascades 
with its empirical properties, some researchers have developed the¬ 
oretical models on individuals’ choice of adopting and engaging 
with a product or activity (17] [24). These models are useful at 
depicting the mechanism behind the observed temporal dynamics, 
however, it is unclear how generalizable they are beyond the par¬ 
ticular product or activity studied. 

Our work contributes to current research in two major aspects. 

First, we study the entire lifecycle of apps over a timespan as 
long as 5 years . Our focus is the persistency of growth other than 
the peak popularity. Different from a viral YouTube video or a 
meme photo, successful apps needs to engage with their users re¬ 


peatedly. Therefore, we spent a significant amount of work analyz¬ 
ing and modeling the retention of apps, and showed its importance 
to the long-term success of apps. 

Second, we study thousands of apps at once. Previously, most 
papers examined the adoption and retention of a single product/activity, 
thus their results might not be generalizable to other domains. By 
studying a large selection of apps on Facebook, we are able to con¬ 
trol for app-specific features and understand how the characteristics 
of an app interact with its social and temporal dynamics. 

Some work similar to ours includes a recent study of the growth 
and longevity of online communities fTO] , the modeling and pre¬ 
diction of the temporal pattern of membership-based websites CD 
and a study of mobile app adoption over a small real-life social net¬ 
work fl6) . Our study builds on these papers in both the scope and 
the variety of examples examined. Also, with the rich dataset we 
have about apps, users, and the underlying social graph, we are able 
to introduce several new theoretical and analytical models, and to 
compare them with recent formalisms CD- By incorporating the 
parameters of a fitted model as part of the feature set, we are able 
to extend and compare different methodologies. 


7. CONCLUSION 

In this paper we studied the lifecycle of apps: as they grow and 
thrive, and, in some cases, as they decline. We studied differences 
in their development, looking for clues to their future fate. First, 
we sought parameters with which to model the interaction between 
the app and the individual. We found that a simple exponential 
decay, even with an adjustment for attrition after the first day of 
use, did not accurately capture user retention. Instead, those who 
keep using the app over a longer time period are less and less likely 
to stop. Modeling retention of individuals in this way is helpful in 
predicting app success. 

Another dimension goes beyond the individual to whether the 
app is adopted socially. Apps vary widely in the sociality of their 
adoption, and we find heterogeneity in the apps based on how their 
adoption probabilities depend on the connectedness of friends who 
use the app and the similarities in attributes between an adopter and 
his or her friends. 

The features most predictive of an app’s future dynamics are 
those describing its past growth trajectory. More widely adopted 
apps that have recently been on a growth trajectory are more likely 
to persist. Given a range of features, we obtain over 20% abso¬ 
lute improvement over random guessing when it comes to making 
a binary prediction as to sustained activity for an app. We also ob¬ 
tain strong performance when we formulate the problem as one of 
matching two apps of roughly equal size which take different tra¬ 
jectories, and trying to distinguish the two with a much higher than 
random accuracy. 

There are a number of further aspects of the app ecosystem that 
would be interesting to take into account in future work. First, app 
adoption is driven in part by the marketing and other recruitment 
strategies of the app owners. Although our models incorporate the 
numbers of new users coming to the app over time, they do not dif¬ 
ferentiate between organic growth and advertising-driven growth. 
Furthermore, it is not clear whether sociality of apps might accel¬ 
erate growth or decline or both. Finally, it is unclear whether some 
features might be early harbingers of future behavior, e.g. whether 
the change in retention of long-time or recently acquired users is 
more useful in forecasting the eventual adoption of the app. We 
leave these and other questions for future work. 
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